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Abstract 

This paper introduces the theory and hardware im¬ 
plementation of two new algorithms for computing 
a single component of the discrete Fourier trans¬ 
form. In terms of multiplicative complexity, both 
algorithms are more efficient, in general, than the 
well known Goertzel Algorithm. 


1 Introduction 

Discrete transforms are mathematical tools used in 
many applications in Engineering. A particularly 
significant example is the discrete Fourier transform 
(DFT) [T]. Let v = (vn), n = 0,N — 1, he a 
sequence of complex numbers or of real numbers. 
The DFT of v is the sequence of complex numbers 
y = (Ife), fc = 0,..., A^ — 1, defined by 

N-l 

Vk=J2 (1) 

n—0 

where Wn = and j = -v/—1. 

The polynomial representation for an input signal 
V, denoted by v{x), is defined by 

N-l 

V{x) = ^ VnX'^. ( 2 ) 

n—0 

Therefore, the component 14 can be computed from 
v{x) by 

Ec = viW^). (3) 

From o, the computation of a single coefficient 
14 £ C, requires — 1 complex multiplications, 
A^ — 1 complex additions and the prestorage of the 
coefficients An algorithm to implement this 

computation, without the need for storing the co¬ 
efficients, was presented in [5]. The Goertzel algo¬ 
rithm, as it became known, computes the compo¬ 
nent 14 via the polynomial 

Pk{x) = {x-W^){x-W^'^) 

= l-2cos^^^ a;-f a;^ (4) 


which is the minimal polynomial oi over the 
field of real numbers. It is possible to write v(x) as 

v{x) = pk{x)q{x) -I- r{x), (5) 

where q{x) and r(x) are obtained by polynomial 
division. Since Pkix) has a zero in IF^, ([3]) can be 
used to derive 

Ec = r(VF^). (6) 

If V has real coefficients, the polynomial divi¬ 
sion by Pk{x) requires N — 2 real multiplications. 
Two real multiplications are necessary to compute 
r(IFj^), so that the Goertzel algorithm requires N 
real multiplications to compute one component of 
an A^-point DFT. The polynomial division can be 
implemented by an autoregressive filter, as shown 
in Figure [TJ 



Figure 1: Autoregressive filter to compute the poly¬ 
nomial division by Pk{x). The component vn-i is 
the first to be fed into the shift register circuit and 
A = 2cos{2TTk/N). The DFT component is com¬ 
puted by Vk =ro + nW^. 

Although the Goertzel algorithm can be used to 
compute the DFT of a given sequence, it is not a fast 
Fourier transform because its computational com¬ 
plexity, for an A^-point DFT, is proportional to N'^. 
It is an attractive procedure for application scenar¬ 
ios where only a few components (not more than 
log 2 N of the N components) of the DFT need to 
be computed, such as in the detection of DTMF 
signals [3]. 

Cyclotomic polynomials play an important role 
in the algorithms introduced in this paper. The N- 
th cyclotomic polynomial, denoted by $ 7 v(a:), is the 








monic polynomial which has as its roots all order N 
elements in C. Therefore 

^n{x)= n (^) 

e|ord(6l)=Ar 

and 

n ‘fd(x)= {x^ - 1 ). (8) 

d\N 

From the Mobius inversion formula [U, it can be 
shown that 

d|Af 


3 The JCO-Goertzel Algo¬ 

rithm 

The computation of R{x) from v{x) via the JCO 
algorithm is multiplication free. The polynomial 
R{x) in (I12p has degree < {4>{L) — 1) and can be 
written as 

R{x) = pkix)q{x) + r{x), (14) 

from which the 14 component can be computed by 
the Goertzel algorithm, as in ([H]). Therefore, the 
number of real multiplications in the JCO-Goertzel 
algorithm is (j){L). Due to the fact that 


where /i(n) is the Mobius function 


(j){L) <L<N, (15) 


I 1, if n = 1; 

p{n) = < 0, if 3ei > 2; (10) 

[ (—1)"*, otherwise, 

and n has the canonical factorization n = 
The degree of ^n{x) is given by 
(j){N), where is the Euler totient function |2|. 

In this paper, a new algorithm for computing a 
single DFT component, which is based on cyclo- 
tomic polynomials, is introduced in Section 11. This 
algorithm is then combined with the Goertzel algo¬ 
rithm in Section 111, to produce the most efficient 
algorithm, in terms of multiplicative complexity (as 
far as we know), for computing a single DFT com¬ 
ponent. In Section IV a hardware implementation 
for the discussed algorithms is proposed and the 
conclusions are presented in Section V. 


2 The JCO Algorithm 

The computation of a single DFT component, 14, 
using the JGO algorithm, considers the cyclotomic 
polynomial $i(x), where 

^ = 53^- <“> 

Then, by definition, $i(x) has a zero in and 
v{x) can be written as 

v{x) = $l(x)(5(x) -I- i?(x), (12) 

where i?(x) can be computed by an autoregressive 
filter and 

Vk = R{W^). (13) 

Unlike the Pkix) polynomial, 4>i(x) has integer 
coefficients which, for L smaller than 105, are equal 
to 0, 1 and -m- Therefore no multiplication is 
required to compute the polynomial division. The 
polynomial 4>i(x) has degree (l){L), so that 2[(j){L) — 
1] real multiplications are needed to compute 14 
using the JGO algorithm. 


it is clear that the JCO-Goertzel algorithm is more 
efficient, in terms of multiplicative complexity, than 
the Goertzel algorithm. 

Table [T] shows the multiplicative complexity (real 
multiplications) of the Goertzel, JCO and JCO- 
Goertzel algorithms, for some values of N and k, 
assuming that G R. When (/)(L) = 2, the cyclo¬ 
tomic polynomial $i(x) is equal to Pk{x) and the 
multiplication by the coefficient A = 2 cos(27rfc/7V) 
is a trivial one. Consequently, for L = 3,4, 6 in Ta- 
ble[TJ the algorithms present the same performance. 
From m, it is clear that the only case for which 
the Goertzel algorithm outperforms JCO is when 
L = N and V is a prime number, as indicated in 
Table □ for TV = 83. 


Table 1: Number of Real multiplications required 
to compute 14, for a length N real sequence u, for 


the Goertzel, JCO and JCO-Goertzel algorithms. 


N 

k 

Goertzel 

JCO 

JCO-Goertzel 

L 

12 

1 

12 

6 

4 

12 


2 

2 

2 

2 

6 


3 

2 

2 

2 

4 


4 

2 

2 

2 

3 

32 

1 

32 

30 

16 

32 


2 

32 

14 

8 

16 


3 

32 

30 

16 

32 


4 

32 

6 

4 

8 

48 

1 

48 

30 

16 

48 


2 

48 

14 

8 

24 


3 

48 

14 

8 

16 


4 

48 

6 

4 

12 

S3 

1,2,3,4 

S3 

162 

82 

83 

120 

1 

120 

62 

32 

120 


2 

120 

30 

16 

60 


3 

120 

30 

16 

40 


4 

120 

14 

8 

30 


4 Hardware Implementation 


The hardware implementation of the Goertzel algo¬ 
rithm can be made using the autoregressive filter 


Hiz) 


1 


(16) 















with input Vn, n = 0,..., N — 1 and output 2/„ 
The filter computes 

Vk = Vn- (17) 


To derive a hardware implementation of the JCO 
algorithm, H{z) is written as 


Hiz) 


n, 


ord{WJ:^)—L,i^k 




Y\.oid(Wi,)=L,i^k (l 
so that, from © and the $L(a:) symmetry, 

T{ovd(Wi,)=L,i^k (l “ 


H{z) = 




(18) 


(19) 


The degrees of the denominator and numerator 
polynomials are and (f>{L) — 1, respectively. 

Therefore, H{z) can be expressed in the form 


H{z) 


1 + aiz-i + ■. ■ + 

1 + + ... + 


( 20 ) 


Figure [2] shows the general hardware implementa¬ 
tion of the JCO algorithm. The multiplications by 
Gj need to be made once only and the multiplica¬ 
tions by bj are all trivial. An attractive aspect of 
this implementation is that the Vn components are 
fed into the shift register circuit in arrival order, 
thus requiring no components storage. 





Figure 2: Autoregressive filter to implement the 
JCO algorithm. The components Vn are fed into 
arrival order and M = 4>{L) — 1. The computed 
DFT component is Vk = Un- 


Example: To compute the component Fi 28 of 
a 1024-point DFT, it is necessary to determine the 
order of IF/qIj. Since IF 1024 = Ws, it has order 
eight. There are (/)(8) = 4 elements of order eight, 
namely, IFs, IFg , IFg and IFg . Therefore 

, (1 - Wsz-^){1 - lF|z-i)(l - lF|z-i) 

which leads to 


H{z) = 


^(1+j)^ 


■jz-2-^(l-j) 


^-3 


The desired DFT component is obtained from the 
filter output as F 128 = 2/1024 • The corresponding 
hardware implementation of the JCO algorithm is 
shown in Figure [3] The computation of F 128 re¬ 
quires only 2 multiplications and 1027 additions, in 
contrast to 1024 multiplications and 2049 additions 
as required by the Goertzel algorithm. 


•^1024 -^1023 ^ 1^0 



Figure 3: Example 1 JCO hardware implementa¬ 
tion, oi = -^(1 -I- j) and 03 = -^(—1 -I- j) = —a{. 


5 Conclusions 

In this paper two new algorithms for computing a 
single component of the discrete Fourier transform, 
the JCO and the JCO-Goertzel algorithms, are pro¬ 
posed. Both algorithms have, in general, a better 
performance in terms of computational complexity, 
when compared to the well known Goertzel algo¬ 
rithm, which is the standard procedure for this type 
of computation. In particular, the JGO-Goertzel al¬ 
gorithm has the lowest multiplicative complexity, as 
far as we know, of the algorithms that compute a 
single component of an A^-point DFT. 

The approach presented in this paper represents 
a change in paradigm with respect to the Goertzel 
method in the sense that, instead of using a fixed 
polynomial of degree 2, the cyclotomic polynomial 
$i(a:) is used. L is an integer that is a function of 
N, the DFT length, and k, the index of the DFT 
component to be computed. This means that dif¬ 
ferent components of the DFT will be computed 
with different complexities. Consequently, consid¬ 
ering that the JCO-Goertzel algorithm requires less 
than N multiplications for each DFT component 
computed, it can be used to compute an A^-point 
DFT with less than multiplications. 

Application scenarios that use the Goertzel algo¬ 
rithm will benefit from the techniques introduced in 
this paper m, 0. In the field of real numbers, for 
instance, the detection of DTMF signals is a typi¬ 
cal and important application [3] . In the finite field 
context, the syndrome computation in the decod¬ 
ing of a BCH code can be implemented by the Go¬ 
ertzel algorithm (TOj. Therefore, considering that a 
finite field version of the results presented here can 


I-kZ-4 































be derived following essentially the same approach, 
the proposed algorithms can be used to assist the 
decoding of such codes. 
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